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1.  Introduction  and  Summary 


There  are  at  least  three  separate  approaches  to  testing  hypotheses 
based  on  ranks  In  the  linear  model.  Three  of  these  methodologies,  scattered 
through  the  literature,  are  described  by  McKean  and  Hettmansperger  (1976), 
Sen  and  Purl  (1977)  and  Adlchle  (1978).  In  addition,  we  will  Introduce  a 
fourth  approach  In  this  paper  based  on  a  suggestion  of  Blckel  (1976)  In 
the  context  of  M-estlmatlon.  All  of  these  tests  have  the  same  approximating 
distribution  under  the  null  hypothesis  and  the  same  asymptotic  efficiency. 
Other  than  to  note  the  asymptotic  similarities  there  has  been  no  previous 
attempt  to  study  the  similarities  and  differences  among  these  tests  for 
small  to  moderate  sample  sizes.  In  particular,  there  have  been  no  sugges¬ 
tions  for  users  on  which  of  these  methods  Is  the  more  practical.  Recommen¬ 
dations  on  the  Implementation  of  these  methods  can  be  found  at  the  end  of 
Section  5. 

In  Sections  2  through  4  we  provide  a  unified  discussion  of  all  four 
tests  In  the  context  of  the  geometry  of  the  linear  model.  By  considering 
the  geometry  of  the  statistics  we  can  quite  easily  describe  differences 
and  similarities  In  the  tests.  In  addition  to  providing  a  comparison  of 
the  rank  tests,  the  geometry  suggests  a  comparison  with  the  classical 
F-test.  There  are  three  algebraically  equivalent  forms  of  the  F  statistic 
and  the  rank  tests  can  be  Identified  with  these  different  forms.  The  rank 
tests,  however,  are  not  algebraically  equivalent  and  this  Is  one  source  of 
their  small  sample  differences.  For  an  excellent  account  of  the  geometry 
of  the  linear  model,  see  Arnold  (1981). 


V 
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In  Section  5  we  investigate  In  a  Monte  Carlo  study  the  small  sample 
levels  and  powers  of  these  tests  along  with  the  F-test.  The  study  In¬ 
cludes  several  designs  and  error  distributions.  On  the  basis  of  this 
study  we  conclude  that  some  of  the  tests  seem  to  be  unusually  sensitive 
to  the  design  and  to  the  error  structure. 

The  new  approach  to  testing  described  here  Is  based  on  Blckel's  (1976) 
Idea  of  pseudo-observations.  The  pseudo-observations  are  constructed  from 
rank  estimates  of  the  parameters  In  the  linear  model  In  such  a  way  that 
the  F-test  calculated  from  these  pseudo-observations  Is  a  normalized 
quadratic  form  in  the  rank  estimates  which  can  be  used  to  carry  out  hypo¬ 
thesis  tests.  A  plot  of  the  pseudo-observations  versus  the  data 
Illustrates  the  effect  which  robust  methods  have  on  the  observations. 

This  plot  is  Included  with  the  example  In  Section  7. 

Both  rank  and  signed  rank  tests  can  be  constructed  and  this  may  be 
a  source  of  some  confusion.  Although  they  differ  numerically,  their 
asymptotic  theory  Is  the  same.  To  avoid  more  notation  we  present  only 
the  signed  rank  versions.  If,  In  the  formulas  for  each  test  described 
In  Section  3,  we  replace  the  signed  rank  score  of  the  absolute  value  of 
the  residual  (defined  under  2.8)  by  the  centered  rank  score  of  the 
regular  residual  and  replace  the  design  matrix  by  the  mean  centered 
design  matrix  we  have  the  corresponding  rank  score  version  of  the  test. 

In  the  case  of  rank  scores  the  Intercept  parameter  Is  handled  separately. 
One  estimate  of  the  Intercept  Is  a  location  estimate  computed  from  the 
residuals.  McKean  and  Hettmansperger  (1976)  and  Sen  and  Purl  (1977) 
discuss  only  the  rank  score  tests  while  Adlchle  (1978)  discusses  both 


rank  and  signed  rank  score  tests.  The  Monte  Carlo  study  Includes 
both  types  In  order  that  their  small  sample  behavior  can  be  compared. 

A  final  note  needed  for  Sections  2  and  3  concerns  estlmablllty. 
In  this  paper,  a  function,  X'3,  of  regression  parameters  is  estimable 
provided  X  lies  in  the  row  space  of  the  design  matrix;  see  McKean  and 
Schrader  (1980)  for  a  discussion  of  estlmablllty  in  terms  of  robust 
estimation. 
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2.  Estimation  In  the  Linear  Model 

2.1  Notation  and  Assumptions 

Let  y  denote  an  n  x  1  vector  of  observations.  Assume  It  follows 
the  linear  model 

Y  -  X0  +  e  (2.1) 

where  X  Is  an  n  x  r  matrix  of  known  constants,  3  Is  an  unknoim  r  x  1  vector 
of  parameters,  and  e  Is  an  n  x  1  vector  of  lid  random  errors,  symmetrically 
distributed  about  0  with  density  function  f(x).  Let  Q  denote  the  subspace 
of  which  Is  spanned  by  the  columns  of  X.  Assume  the  dimension  of  H  Is 

p  £  r.  Then  alternately  we  can  write  the  model  (2.1)  as 

Y  =•  e  +  e,  9  e  (2.  (2.2) 

When  expectations  exist  EY  ■  9;  in  any  case,  Y  is  symmetrically  distri¬ 
buted  about  9. 

For  vectors  y,  z  e  R°  let  <y,  denote  the  usual  inner  product; 

so  <y,  z>  =  vectors  are  orthogonal  when  their  inner  product  is 

0.  Let  i)-  denote  the  orthogonal  complement  of  fi,  the  collection 

of  vectors  in  r”  which  are  orthogonal  to  all  vectors  in  12. 

L/  2 

Denote  the  usual  Euclidean  norm  by  Hy I  Ils  *  '  Least 

squares  procedures  are  then  based  on  this  norm.  Procedures  based  on  ranks 

use  another  norm  which  involves  a  set  of  scores  a(l) . a(n).  These 

are  often  generated  by  a  non-negative,  non-decreasing,  square  integrable 
function  4>(u),  0  <  u  <  1,  by  setting  a(l)  ■  $(1/ (n+1) ) .  Without  loss  of 
generality,  /<t>^  *  1.  Scores  that  are  frequently  used  are  the  sign  scores, 
i^(u)  ■  1,  and  the  Wilcoxon  scores,  <^(u)  «  S'*'  u.  Let  Rly^l  denote  the 
rank  of  ly^|  among  ly^^l,  ...,  IYjjI  J  then  for  a  given  set  of  scores 
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the  function  on  defined  by 

I |y| Ir  “  <a(R|y|),  |y|>  (2.3) 

-  DaCRiyJ)  jyj 

Is  a  norm  on  R° ;  see  McKean  and  Schrader  (1980).  Note  that  for  sign  scores, 
I  |y|  Ir  the  Lj^-norm.  In  general  we  refer  to  (2.3)  as  the  weighted 
least  absolute  deviation  norm  (WLAD-norm) ,  and  the  weights  depend  on  Che 
size  of  the  absolute  residuals. 

2.2  Prediction  and  Estimation 

Let  11*11  represent  any  norm  on  R^.  A  prediction  of  Y  or  estimate 

/V  /N 

of  0  Is  defined  as  a  point  6  In  Q  closest  to  y,  chat  is,  6  satisfies 

||y  -  9||  »  min  ||y  -  e||,  6  e  0.  (2.4) 

Such  a  point  always  exists.  For  the  Euclidean  norm  |  hills’  ®  least 

squares  projection  of  y  onto  Q  which  we  will  denote  by  0^-.  For  the  norm 
I  I* 1 Ir,  we  will  call  9  a  best  rank  or  R-predlction  of  Y  and  denote  it  by 
0j^.  Computation  of  predictions  is  discussed  In  Section  4. 

^  A 

When  the  gradient  V| |y  -  X8( (  exists,  9  *  Xg  is  determined  by  the 
equation 

7| (y  -  X6| I  -  0.  (2.5) 

In  the  case  of  least  squares  (2.5)  represents  the  linear  normal  equations 
-2X'(y  -  Xg)  »  0 

-1 

which,  in  the  full  rank  case,  results  in  the  estimate  g^^^  *  (X'X)  X’y. 

Note  Chat  Che  least  squares  residual  vector  Is  orthogonal  to  Q,  l.e. 

(y  -  9^g)  e  fi-t.  (2.7) 

In  the  case  of  the  weighted  least  absolute  deviations  norm  the  gradient 
exists  at  all  but  a  finite  number  of  points.  The  corresponding  non-linear 
equation  Is 
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V|  ly  -  xel  Ij^  -  -X*  a+(R|y  -  XB|)  -  0  (2.8) 

where  a+(R|y^  -  e^|)  -  a(R|y^  -  0^|)  sgn  (y^  -  0^)  for  i  -  1 . n. 

From  (2.8)  It  follows  that  a  best  R-predlctlon  Is  determined  so  that 

/\ 

the  vector  of  signed  rank  residuals  a'*'(R|y  -  6^|)  Is  orthogonal  to  ^2,  i.e. 

a'^(Rly  -  9r1)  e  (2.9) 

For  the  R-estlmates,  the  minimum  distance  of  y  to  My  -  9|  L  Ij 
unique.  Although  Che  WLAD- estimate  is  not  unique,  under  regularity  con¬ 
ditions  the  diameter  of  Che  solution  set  tends  to  0  in  probability  quite 
rapidly,  see  Jaeckel  (1972). 

The  gradient  (2.8)  consists  of  signed  rank  statistics  appropriate  for 

testing  Che  various  components  of  B.  The  gradient  equation  yields  estimates 

which  can  be  considered  as  extensions  of  the  rank  estimates  of  location 

1/2 

proposed  by  Hodges  and  Lehmann  (1963).  If  4>(u)  3  u  and  X  is  the 

vector  of  n  ones  then  (2.8)  becomes  the  Wilcoxon  signed  rank  statistic 
and  S  is  the  median  of  the  n(n  +  l)/2  pairwise  averages  of  the  observations. 
A  motivation  for  replacing  the  LS  norm  with  the  WLAD  norm  is  that  the 
Influence  of  an  aberrant  y  point  is  bounded  in  the  case  of  B^  and  unbounded 
in  the  case  of  B^ „.  This  type  of  robustness  is  discussed  by  Hampel  (1974) 
and  does  not  extend  to  protection  against  aberrant  design  points.  Another 
motivation  is  the  Increased  estimation  efficiency  discussed  next. 

2.3  Asymptotic  Theory  for  Estimation 

For  Che  full  rank  case,  under  suitable  regularity  conditions  (see 
Huber  (1973),  Jaeckel  (1972),  Jureckova  (1971),  Kraft  and  Van  Eeden  (1972)) 
Che  estimates  B  derived  from  the  LS  or  WLAD  norms  are  approximately  normally 


distributed  as 


(2.10) 


e  %  MVN(e,  K-^cx'x)"-^). 

2  2 

For  least  squares  K  ~  a  ,  Che  variance  of  Che  error  distribution.  For 

2  2 

Che  WLAD  estimate  K  >  t  where 

f'[F~^({2“^(u  +  l)})]/f[F“^({2“^(u  +  l)})]du.  (2.11) 

Following  Blckel  (1964)  the  efficiency  of  relative  to  is 
a  /t  .  In  the  case  of  Wilcoxon  scores  a  /t  *12  a  (/f  (x)dx)  which  is 
bounded  below  by  .864  and  may  be  arbitrarily  large,  depending  on  F.  When 
F  is  Che  normal  cdf  the  efficiency  for  Wilcoxon  scores  is  .955.  See 
Lehmann  (1975  Sections  2.4  and  4.3)  for  a  further  discussion  of  the  efficiency 
Hence  we  find  that  Che  WLAD  norm  produces  rank  estimates  which  are  generally 
more  efficient  Chan  the  least  squares  estimate,  at  least  for  error  distri¬ 
butions  with  tails  heavier  Chan  those  of  Che  normal  distribution. 
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3.  Testing  In  the  Linear  Model 
3.1  General  Linear  Hypotheses 

For  the  model  (2.1)  we  are  Interested  In  general  linear  hypotheses 
of  the  form 

H  :  H6  “  0  versus  H.  :  H6  0  (3.1) 

O  A 

where  H8  Is  a  collection  of  q  linearly  Independent  estimable  functions. 

Let  be  the  (p-q)  dimensional  subspace  of  constrained  by  the  hypothesis 

H6  »  0.  In  terms  of  the  model  (2.2)  we  are  testing 

H  :  0  e  versus  H,:  0  e  -  Q  .  (3.2) 

o  o  A  o 

We  will  call  the  model  (2.1),  0  £  the  full  model  and  the  model  with  the 
constraint  0  e  (2  the  reduced  model. 


3.2  Tests  based  on  minimum  distances 


Given  a  norm  |  |'  |  |  on  R°,  tests  of  (3.2)  can  be  naturally  constructed 

by  comparing  the  distances  between  y  and  the  two  subspaces  Q  and  ^2^.  If 

0^,  0  are  the  best  reduced  and  full  model  predictions  of  y,  for  the  norm, 

then  the  minimum  distances  are  | | y  -  | |  and  | | y  -  0 | | . 

If  the  norm  Is  | |’ {  then  the  comparison  is  between  the  square  of 

'^2  ^2 

the  minimum  distances,  ||y-9  IItc”  l|y“®liTc'  Asymptotic  distribution 

2 

theory  suggests  standardizing  this  difference  by  an  estimate  of  a  ,  the 
variance  of  the  error  distribution.  The  usual  F-test  is 


F 


LS 


l|Y- 


|Y  -  0|| 


LS 


(3.3) 
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2  ^2 

where  *  | | y  -  6 | |  / (n  -  p) .  Under  H  and  regularity  conditions,  see 

IrfO  O 

2 

Huber  (1973),  asymptotic  x  (q)  distribution.  The  usual  small 

sample  test  compares  F  with  F(q,  n  -  p)-critical  values,  rejecting  H 

laO  O 

at  approximate  level  a  if  F  ^  F(a,  q,  n  -  p).  Note  also  that  although 

9^  and  9  are  derived  from  I  1’ I l^g  Che  notation  has  been  suppressed. 

If  the  norm  |  |' ( L  is  used  then  a  natural  comparison  is  between  the 
K 

minimum  distances,  |  |y  -  9  |  |„  -  My  ~  asymptotic  distribution 

theory  developed  by  McKean  and  Hettmansperger  (1976)  leads  to  standardizing 
this  difference  by  an  estimate  of  T,  (2.11),  resulting  in  the  test  statistic, 

|Y-  ejl^-  My-  6| 

q(^/2) 

where  x  is  an  estimate  of  T,  for  Instance  (4.2).  Under  and  regularity 

conditions  (see  McKean  and  Hettmansperger  (1976))  qD  also  has  an  asymptotic 
2 

X  (q)  distribution.  For  small  samples,  the  level  of  the  test  seems  to  be 
more  stable  if  the  F(a,  q,  n  -  p)  critical  point  is  used;  see  Hettmansperger 
and  McKean  (1977)  and  Section  5  of  the  present  paper.  Hence,  the  test 
rejects  H^  at  approximate  level  a  if  D  ^  F(a,  q,  n  -  p). 

Because  of  the  close  relationship  between  the  inner  product  <•,•> 
and  the  norm  M  *  Mtc  ^  statistic  (3.2)  can  be  written  in  two  other 
algebraically  equivalent  forms.  Other  rank  tests  for  the  linear  model 
can  be  identified  with  these  alternative  forms  of  the  F;  however,  since 
they  are  based  on  the  norm  j  |‘Md»  algebraically  equivalent. 

We  next  discuss  these  other  forms  of  the  F-test  and  the  corresponding 


(3.4) 


rank  tests.  For  convenience  we  will  assume  X  has  full  column  rank. 
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3.3  Tests  based  on  Che  full  model  estimates 


This  form  of  the  F  statistic  can  be  derived  from  (3.3)  using  the 
Pythogorean  theorem.  Let  be  the  orthogonal  complement  of  in  (2; 


o 

thai  ||y  -  5^1  1^5  -  ||y  -  611^5  -  l|rj,jy|lLs 


LS 


Ils’ 


where  denotes  the  projection  of  y  onto  .  Hence 

'  o  ° 


F  = 


'  ^(2 1  I  I  LS 


qa‘ 


(3.5) 


From  (3.1),  (2^  is  determined  by  H3  =  0.  Assume  X  is  a  basis  matrix  for  (2 
then  e  =  (X'X)“^X'e  and  H(X'X)“^X'e  »  0  for  every  6  e  It  then  follows 

that  Z  =  X(X'X)”^H'  is  a  basis  matrix  for  (2|Q^  and  y  =  Z(Z'Z)~^Z’y 


When  the  definition  of  Z  is  combined  with  X*  (3’ 5)  becomes 


„  ^  (HB)'[H(X'X)~^H']'^(HB). 
qa 


(3.6) 


This  is  the  coordlnatlzed  version  of  (3.3)  and  expresses  Che  numerator  in 
terms  of  Che  full  model  least  squares  estimates. 

For  the  corresponding  rank  test,  replace  the  least  squares  estimate 

/s  ''2  ''2 

in  (3.6)  with  the  rank  estimate  B^  and  replace  a  by  x  .  Standard  asymptotic 
theory  based  on  (2.10)  shows  that  q  times  this  test  statistic  is  approxi¬ 
mately  chi  squared  with  q  degrees  of  freedom.  Preliminary  simulations 


Indicate  that  the  probability  of  a  type  I  error  is  better  controlled  if 
2  ''2 

T  is  estimated  by  x  ,  (4.2),  which  includes  the  bias  correction  similar 
to  chat  used  in  least  squares.  The -test  is  carried  out  by  referring 


B  ,  (HBj^)'[H(X'X)“^H']  ^(H6j^) 

qx^ 


(3.7) 
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to  an  F(a,  q>  n  -  p)  critical  point.  In  the  numerator  3^  can  be  replaced 

by  3^*^^.  (4.1),  the  k-step  rank  estimate. 

Blckel  (1976)  defined  a  pseudo-observation  based  on  an  M-estlmate 

and  described  how  these  pseudo-observations  could  be  used  In  a  least  squares 

program  to  yield  a  robust  test  of  hypothesis:  see  Schrader  and  Hettmansperger 

(1980).  We  now  adapt  this  Idea  to  the  WLAD  or  rank  estimate  approach. 

^  Given  the  full  model  estimate  9^^  =  the  pseudo-observation 

y  by: 

y  -  XBj^  +  Xa‘^(Rly  -  X3j^|)  (3.8) 


-  +  Xa+(R|y  -  Q^\) 

where  X  Is  a  constant  to  be  determined.  Note  that  since  a‘*‘(R|y  -  6j^|)  £  fi-*- 

from  (2.9),  the  least  squares  estimate  of  3  based  on  y  Is  The  least 

R 

%  ^ 

squares  variance  estimate  based  on  y  -  6^^  Is 


''2  ^2 

then  a  =  t  ,  the  proper  denominator  In  (3.7).  In  the  case  of  Wllcoxon 

^  1/2 

scores,  X  *  x [ (n  +  1)/ (n  -  1) ] 

Hence  we  can  compute  B  In  (3.6)  quite  easily,  as  follows:  1.  find 

^  ^  «  f\, 

3^  (or  0^)  and  x,  2.  construct  y  and  3.  use  y  In  a  least  squares  AOV  program. 
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The  numerator  of  B  Is  the  appropriate  line  of  the  AOV  table  and  the  denomi¬ 
nator  Is  the  error  line.  The  pseudo  observations  also  have  diagnostic  value 

f\j 

for  data  analysis.  A  plot  of  y  against  y  shows  the  effect  of  "robustlf Icatlon" 
on  the  data.  This  Is  Illustrated  In  the  example  of  Section  7. 

3.4  Aligned  Tests 


The  aligned  tests  are  most  conveniently  described  for  the  case  H  ■ 
(0,  I)  where  I  is  the  q  x  q  Identity.  The  model  (2.4)  Is  partitioned  as 


Y  -  +  X282  +  e  (3.11) 

where  X^  and  X2  are  of  order  nx(p  -  q)  and  n  x  q  while  and  B2  are  (p  -  q) 

X  1  and  q  X  1  vectors,  respectively.  The  null  hypothesis  (3.1)  becomes 
B2  =  0  where  Bj^  Is  treated  as  a  nuisance  vector. 

The  least  squares  F  test  can  be  derived  by  first  removing  the  effects 
of  the  nuisance  vector  Bj^  by  projecting  both  y  and  the  columns  of  X2  onto 
.  Hence  consider  ~  y  ~  y  ^^•*■^2  *  ^2  ~  ^2  ^  ” 

X.  (Xj^'X,)~^X  'y.  Now  using  ^^^1X2  as  the  design,  project  the  vector 

o  o 

to  construct  the  F  test  for  H  :  6~  =  0.  The  numerator  in  this  case  is  the 

o  i 

squared  length  of  this  projection  and  the  resulting  form  of  the  F  statistic  is 


F 


-1., 


(Pjj^y)'X2(X2’Pjj^X2)  *X2*(Pj^^y) 


qa 


(3.12) 


This  can  be  derived  algebraically  from  (3.3)  and  expresses  the  numerator 
as  a  quadratic  form  in  the  reduced  model  residuals.  Draper  and  Smith 
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(1966,  Section  4.1)  discuss  this  method  for  fitting  a  regression  equation 
with  two  Independent  variables  as  a  sequence  of  simple  regression  fits. 

ys  A 

Now  let  9^  «  denote  the  reduced  model  rank  estimate.  Hence  (similar 

to  (2.9)) 

a'*‘(R|y  -  9^1)  e  (3.13) 

Using  the  numerator  In  (3.12)  we  replace  the  reduced  model  residuals  by 
the  reduced  model  signed  ranks  of  the  residuals  and  define 

Q  -  [a'^(R|y  -  9^1)] 'X2(X2’Pj^iX2)‘^X2'a'^(R|y  -  9^  | ) .  (3.14) 

o 

The  vector 

S2  =  X2’a'^(R|y  -  9^1)  (3.15) 

Is  called  the  vector  of  aligned  rank  statistics  and 

Q  =  S2'(X2'P^jX2)’^S2  (3.16) 

“  o 

Is  called  the  aligned  rank  test  statistic.  See  Hodges  and  Lehmann  (1962) . 

The  results  of  Sen  and  Purl  (1977)  when  specialized  to  the  univariate 
linear  model  show  that  Q  has  an  approximate  chi  squared  distribution  with 
q  degrees  of  freedom.  Hence  the  asymptotic  test  of  H^:  $2  “  ^  rejects 
when  Q  >  x^(a,  q) 

We  next  derive  an  equivalent  rank  test.  From  (3.13)  we  can  replace 

A  ^ 

a'^CRly  -  9^1)  by  P^j^a'*’(R|y  "9^1)  in  (3.14).  Using  the  matrix  identity 
o 


(3.17) 
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we  can  write  Q  in  (3.16)  as 

Q  -  [a'^CRjy  -  '  P^^la+CRjy  -  9^|).  (3.18) 

o 

This  version  of  Q  Is  introduced  and  discussed  in  detail  by  Adichie  (1978). 

/V 

In  his  discussion  the  estimate  6  need  not  be  a  rank  estimate  to  obtain 

o 

the  as3rmptotic  distribution.  To  our  knowledge  no  study  has  been  made  of 
the  sensitivity  of  Q  to  the  type  of  estimate  used  for  6^.  Since  Q  is  a 
rank  test  it  would  seem  most  natural  to  use  a  rank  estimate  and  under  this 
condition  the  aligned  rank  test  (3.14)  is  equivalent  to  Adlchle's  test 
(3.18).  For  the  remainder  of  this  paper  the  term  aligned  rank  tests  will 
refer  to  either  construction. 

All  four  tests:  (3.4),  (3.7),  (3.14)  and  (3.18)  are  very  similar 
in  their  asymptotic  properties.  They  all  have  an  asymptotic  chi  squared 
distribution  under  the  null  hypothesis  and  they  have  the  same  asymptotic 
efficiency.  We  have  further  shown  in  this  section  that  though  they  are 
not  algebraically  identical,  each  is  closely  associated  with  one  of  the 
various  equivalent  forms  of  the  F  test.  In  Section  5  we  consider  some  of 
the  small  sample  power  properties  via  Monte  Carlo  simulations. 
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4.  Computation 


The  R-estlmates,  predicted  values,  and  hence  minimum  distances  can 
be  obtained  by  the  k-step  procedure  discussed  by  McKean  and  Hettmansperger 
(1978a).  It  Is  an  algorithm  based  on  approximating  the  dispersion  function 
I |y  -  X3| by  a  quadratic  function.  For  a  general  design  matrix  X  the  kth 
step  estimate  of  6  Is 


while  for  the  full  rank  case  the  estimate  of  3  Is 

g(k)  ^  gCk-l)  +  T^*^“^^(X'X)~^V|  |y  -  X3^^~^^|| 

-  g(k-l)  ^  x^^"^)(X'X)"^X'a'*’(R|y  -  X3^'*^”^^|). 


(4.1) 


In  general  this  algorithm  will  not  converge  and,  hence,  to  obtain  fully 

Iterated  estimates  algorithms  such  as  steepest  descent  must  be  used.  The 

asymptotic  distribution  theory  for  Inference  based  on  these  k-step  estimates, 

however.  Is  the  same  as  that  of  the  fully  Iterated  estimates.  Furthermore 

In  a  simulation  study  by  the  authors  (1978a),  small  sample  results  were 

practically  the  same  for  estimates  of  2  or  3  steps  as  that  of  the  fully 

Iterated  ones.  For  starting  values  resistant  to  outliers,  we  recommend 

)l^-estlmates.  Recent  algorithms  such  as  Barrodale  and  Roberts  (1973)  and 

Bartels  and  Conn  (1980)  make  1^-starts  computationally  feasible. 

(k) 

Note  from  (4.1),  that  the  k-step  residuals,  r  ,  can  be  written  as 

,(k)  .  ^ck-i)  .  p^(;(k-i),+(jl^<k-i)|„ 

where  Is  the  projection  transformation  onto  Hence  a  convenient,  and 
numerically  stable  algorithm.  Is  based  on  first  obtaining  a  QR-  decomposi¬ 
tion  of  the  design  matrix  X,  using,  for  Instance,  the  collection  of 
algorithms  LINFACK  by  Dongara  et  al.  (1979).  A  QR-decomposltlon  results 
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In  an  orthonormal  basis  for  (2  from  which  projections  can  readily  be  formed. 
Another  advantage  is  that  the  design  matrix  X  need  not  be  of  full  column 
rank.  For  fully  Iterated  estimates,  steepest  descent  in  terms  of  an  ortho- 
normal  basis  is  simply  a  search  along  the  direction  specified  by 

^(Ir)  ''(’k-l) 

9''  '  _  9'  Finally,  reduced  model  design  matrices  for  natural  hypotheses 

of  the  form  H3  ~  0  can  readily  be  obtained  using  QR-decompo sit ions;  see 
McKean  and  Schrader  (1981). 

A  consistent  estimate  of  T  proposed  by  the  authors  (1976),  (1978a),  is 
T  «  (n^^^(U  -  L)/2Z^/2>tn/(n  "  (4.2) 

where  ^a/2  is  the  (1  -  a/2)  percentile  point  of  the  standard  normal  distri- 

“1/2 

but ion  and  U  and  L  are  solutions  C  of  the  equations  n  5:a+(R|r^  -  c|)  A  Z 
for  Z  «  ”^ct/2  ^a/2*  ^^®sp®ctively.  In  the  case  of  Wilcoxon  scores  U 

and  L  are  ordered  Walsh  averages  of  the  residuals.  Even  in  this  case, 
though,  the  equations  are  solved  by  an  iterative  algorithm  similar  to  the 
one  discussed  by  McKean  and  Ryan  (1977). 

As  the  authors  (1977)  discussed,  small  sample  corrections  are  necessary 
for  this  estimation  of  x.  One  such  correction  is  given  by  the  term  in  the 
brackets  of  (4.2)  which  corresponds  to  the  usual  least  squares  degree  of 
freedom  correction  for  the  estimate  of  variance.  Another  correction  which 
has  proven  useful  in  the  Wilcoxon  case  is  to  modify  the  Z  in  the  above 
equations  so  as  to  eliminate  the  p  smallest,  in  absolute  value,  ordered 
Walsh  averages.  A  similar  idea  was  used  by  Hill  and  Holland  (1977)  for 
a  scale  estimate  based  on  L^-reslduals.  The  estimate  of  t  used  in  Sections 
5  and  7  employed  both  of  these  small  sample  corrections  with  “  1.645. 
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5.  Monte  Carlo  Study 

As  discussed  In  Section  3  the  different  rank  tests  have  the  same 
asymptotic  null  distribution  and  relative  efficiency.  Monte  Carlo  simula¬ 
tions  are  needed  to  Irvestlgate  their  small  sample  behavior.  Simulation 
studies  by  the  authors  (1977),  (1978a),  (1978b)  have  Indicated  that  for 
the  statistic  D,  (3.4),  the  probability  of  a  type  I  error  Is  quite  stable 
over  a  variety  of  underlying  distributions.  This  stability  Is  confirmed 
In  the  study  discussed  below.  Small  sample  properties  of  the  rank  test 
B,  (3.7),  and  the  aligned  rank  teats,  (3.14)  and  (3.18),  have  not  previously 
been  Investigated. 

Before  turning  to  the  comparative  simulation,  we  will  briefly  discuss 
two  recent  simulations  of  related  aligned  rank  tests  proposed  by  Sen  (1969) 
and  Adlchle  (1974)  for  testing  parallelism  of  several  regression  lines. 

Sen  ranks  within  the  Individual  data  sets  and  avoids  estimating  the  Inter¬ 
cept;  Adlchle  ranks  the  combined  data  set  but  assumes  the  intercepts  are 
all  equal  and  avoids  estimation  of  the  common  Intercept. 

Lo,  Slmkln,  and  Worthley  (1978)  did  a  simulation  study  of  these  tests 
of  parallelism  for  the  case  of  equal  Intercepts.  The  aligned  rank  tests 
performed  uniformally  worse  than  the  least  squares  F-test  on  the  distribu¬ 
tions  considered.  The  power  of  the  aligned  tests  was  always  low  and  In 
some  cases  almost  nonexistent. 

Smlt  (1979)  performed  a  similar  simulation  study.  His  results  also 
Indicated  that  least  squares  generally  performed  better  than  the  aligned 
tests.  There  Is,  however,  one  glaring  difference  with  the  first  study. 
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In  several  cases  the  F  test  and  Sen's  test  had  very  similar  power  but 
Adlchle's  test  had  little  or  no  power.  A  careful  comparison  of  the  two 
studies  yields  contradictory  conclusions.  In  the  first  study  the  aligned 
rank  tests  behaved  similarly  and  were  Inferior  to  the  F  test.  In  the 
second.  Sen's  test  and  the  F  test  often  behaved  similarly  and  were  superior 
to  Adlchle's  test. 

The  aligned  tests  described  In  Section  3  were  developed  for  the  general 
linear  hypotheses,  quite  analogous  to  least  squares,  and  differ  from 
these  earlier  tests.  With  the  above  studies  In  mind,  we  decided  to  In¬ 
vestigate  the  tests  discussed  In  Section  3  on  essentially  the  same  model 
as  considered  by  Lo,  Simkln,  and  Worthley  (1978):  three  regressions  with 
unconstrained  Intercepts,  equal  sample  sizes,  and  unlforml  '  spaced  x's. 

We  considered  sample  sizes  of  5  and  10,  hence,  total  sample  sizes  of  15 
and  30.  The  dimension  of  H  for  this  design  is  6;  the  ratios  of  observations 
to  parameters  are  2.5  and  5.  We  label  this  design  A. 

A  second  parallel  regression  problem,  design  B,  consists  of  two  regres¬ 
sions  with  common  Intercepts  and  x's  placed  at  1,  2,  3,  4,  5,  10  for  the 
first  sample  and  at  7,  8,  9,  10,  11,  12  for  the  second.  This  design  con¬ 
tains  a  point  of  moderate  leverage  corresponding  to  the  point  10  in  the 
first  sample.  This  Is  a  valuable  point  of  the  design  and  should  not  be 
confused  with  points  of  leverage  combined  with  discrepant  observations, 
see  Hoaglln  and  Welsch  (1978).  Least  squares  and  the  rank  tests  D  (3.4) 
and  B  (3.7)  are  not  affected  by  this  design  but,  as  shown  below  and  discussed 
In  the  next  section,  the  aligned  tests  are  adversely  affected.  The  obser¬ 
vation-parameter  ratio  for  this  design  Is  12  to  3.  As  In  the  first 
design,  we  are  testing  for  parallelism. 


In  order  to  study  the  performance  of  these  tests  on  both  moderate  and 
heavy  tailed  error  structure,  we  selected  normal  and  Cauchy  errors.  With 
Lo,  Simkln  and  Worthley's  study  In  mind,  we  also  Included  the  double 
exponential  distribution  for  design  A.  The  normals  were  obtained  using 
the  transformation  proposed  by  Marsaglla  and  Bray  (1964)  on  a  pair  of 
uniform  variates.  The  uniforms  were  generated  by  the  algorithm  UNI  developed 
by  Gross  (1976).  The  double  exponential  and  the  Cauchy  observations  were 
generated  in  the  form  normal  over  an  independent  variable  as  noted  in 
Simon  (1976).  The  tests  simulated  were  F,  (3.3);  D,  (3.4);  B,  (3.7); 
and  q,  (3.14).  Both  signed  rank  and  rank  scores  were  used.  The  results 
are  all  based  on  500  simulations. 

Empirical  5%  and  10%  levels  for  the  test  statistics  on  all  the  designs 
and  distributions  are  displayed  in  Table  5.1.  Least  squares  (3.3)  and  the 
rank  tests  based  on  drop  in  dispersion  (3.4)  are  fairly  stable  over  almost 
all  the  situations.  The  tests  based  on  R-pseudo-observations,  (3.7), 
appear  to  be  conservative  for  the  small  sample  sizes  in  the  normal  model 
and  tend  to  be  liberal  for  Cauchy  errors  on  design  A  with  n^  =  10  and 
design  B.  This  behavior  for  the  statistic  B  on  Cauchy  errors  confirms 
similar  findings  on  tests  derived  from  M-pseudo-observations  (Schrader  and 
Hettmansperger  (1980)).  It  is  also  predictable  in  the  light  of  robust 
regression  estimators  which  seem  to  have  larger  small  sample  variances  than 
their  asymptotic  counterparts  for  heavy  tailed  error  structures;  see  Huber 
(1973)  and  McKean  and  Hettmansperger  (1978a).  Small  sample  corrections 
for  the  tests  based  on  pseudo-observations  seem  to  need  some  measure  of 
tall  weight  of  the  underlying  error  structure. 

-  Table  5.1  about  here  - 
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The  null  levels  for  the  aligned  tests  are  more  eratlc.  They  are 
quite  liberal  for  the  small  sample  size  on  design  A.  This  Improves  some¬ 
what  for  the  larger  sample  size.  For  design  B  they  seem  to  be  conserva¬ 
tive,  especially  the  aligned  test  based  on  rank  scores.  As  shown  In  Table 
5.4,  their  power  was  adversely  affected  by  this  design;  for  Instance, 
the  least  squares  test  Is  as  powerful  on  Cauchy  errors  as  the  aligned 
tests  based  on  rank  scores.  The  aligned  rank  tests  exhibited  even  worse 
behavior  for  other  designs  which  Included  points  of  moderate  leverage. 

A  partial  explanation  of  this  behavior  Is  found  In  the  next  section. 

Simple  small  sample  corrections,  such  as  using  F-crltlcal  values, 
that  would  Improve  the  behavior  of  the  aligned  tests  on  design  A,  would 
be  quite  detrimental  to  their  behavior  on  design  B.  Due  to  their  sensitivity 
to  design,  small  sample  corrections  for  the  aligned  rank  tests  appear  to 

be  more  complicated  than  those  for  the  tests  based  on  the  drop  In  dispersion 
or  pseudo-observations.  Corrections  for  the  aligned  tests  need  more  In¬ 
formation  involving  the  design  matrix. 

The  results  of  the  power  study  for  the  tests  at  level  .05  appear  In 
Tables  5.2  -  5.4.  The  alternatives  were  selected  separately  for  each  distri¬ 
bution  In  order  to  achieve  a  reasonable  range  of  powers.  For  valid  compari¬ 
sons,  the  empirical  levels  should  be  close  to  .05.  Since  this  Is  true  for  the 
least  squares  test  and  Che  rank  test  based  on  D,  (3.4),  these  tests  can  be 
compared  (ocher  chan  LS  at  Cauchy  errors  on  design  B) .  For  all  designs,  least 
squares  dominates  on  normal  errors,  while  D  dominates  on  Cauchy  errors.  On 
double  exponential  errors  note  Chat  least  squares  Is  slightly  more  powerful 
for  samples  of  size  5,  whereas  D  is  more  powerful  for  the  samples  of  size  10. 
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The  test  B  based  on  R-pseudo-observations  is  slightly  dominated  by  the 
test  D. 


-  Tables  5.2  -  5.4  about  here. 


When  comparable  (design  A,  n^  =  10,  double  exponential  or  Cauchy  errors) 
the  power  results  of  the  aligned  tests  are  similar  to  D.  Certainly  the 
results  on  aligned  tests  are  much  improved  over  the  earlier  tests  of 
Sen  (1969)  and  Adichle  (1974)  considered  by  the  two  studies  mentioned 
above.  The  adverse  behavior  of  aligned  tests  on  design  B  was  noted 
above.  The  results  for  the  rank  tests  other  than  the  aligned  tests  seem 
to  be  similar  for  rank  or  signed-rank  scores.  Neither  type  of  score 
dominated  the  other. 

Our  general  recommendation  is  to  use  the  WLAD  or  rank  estimate  3 
along  with  D.  This  approach  combines  the  estimate  which  minimizes 
a  data  fitting  criterion,  with  the  test  that  considers  the  reduction  in 
D  due  to  fitting  the  various  models  under  consideration.  The  asymptotic 
theory  and  corresponding  small  sample  adjustments  combine  to  provide  an 
effectively  distribution  free,  robust  test,  and  a  robust  estimate.  The 
level  and  power  of  D  appear  quite  stable  over  a  variety  of  underlying  error 
distributions. 
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6.  The  Effect  of  Leverage  In  the  Design 


In  order  to  understand  the  behavior  of  the  test  statistics  on  the 
design  containing  a  point  of  moderate  leverage,  consider  a  simple  model 
containing  two  predictor  variables  with  observations  taken  at  the  points 

X2)  as  shown  In  Table  6.1.  The  point  (0,  x)  Is  an  extreme  point 
-  Table  6.1  about  here  - 

In  the  X2-dlrectlon.  Let  y  denote  the  observation  corresponding  to  (0,  x) . 
Consider  the  full  model  Y  *  ■*“  ®2^2  ^  hypothesis  ”  0 

versus  $2  0* 

If  y  Is  on  the  surface  then  It  Is  a  valuable  point  In  determining  the 
fit.  The  reduced  model  residual  at  y  will  be  large  and  Its  value  Incorporated 
Into  F  and  D;  Q  however  down  weights  the  residuals  and  consequently  loses 
power. 

In  order  to  see  this  consider  a  perfect  model:  Y  =  X.  +  X2  +  0;  that 
Is  all  the  points  lie  on  the  surface.  Since  the  denominators  of  F  and  D 
are  then  zero,  we  will  only  consider  their  numerators.  A  straight  forward 
calculation  shows 


3(3xy  +  20) 

“  10 (3x^  +  20) 


3(3. 78x  +  18.89)^ 
^  ”  10(3x^  +  20) 


(6.2) 


In  computing  Q  we  supposed  that  y  >  1  so  that  the  reduced  model  residual 
has  rank  10.  When  y  Is  on  the  surface  we  have  y  >  x.  As  x  Increases,  the 
leverage  Increases,  F  also  Increases  but  Q  decreases.'  In  fact  Q  may 


23 


decrease  below  Che  critical  value  and  fail  to  detect  ^2  ^ 

formulas  (6.1)  and  (6.2)  continue  to  hold  when  y  is  not  on  Che  surface. 

In  (4.2)  we  see  that  Q  is  only  dependent  on  y  through  the  rank.  Again 
when  y  -  X  since  Che  full  model  fit  is  perfect,  the  full  model  dispersion 
is  0  and  D  only  depends  on  Che  reduced  model  dispersion.  Now  for  the 
example,  when  y  •  x  and  x  >  0,  the  numerator  of  D  is  (for  rank  scores), 

D  -  2(5.67  +  1.42y)  (6.3) 

Here  x  does  not  enter  the  formula  since  it  does  not  appear  in  Che  reduced 
model.  Note  Chat  D  Increases  with  y  as  the  leverage  increases. 

This  simple  example  provides  some  explanation  for  the  adverse  effect 
of  moderate  leverage  on  aligned  tests  noted  in  the  simulations.  It  further 
suggests  Chat  the  other  tests  will  not  be  so  effected. 
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7.  An  Example 

We  consider  a  3  x  4  factorial  experiment  discussed  by  Box  and  Cox 
(1964).  An  experiment  was  carried  out  on  48  animals  to  study  the  relative 
effectiveness  of  3  poisons  and  4  treatments.  There  were  4  animals  at  each 
poison  X  treatment  combination  and  the  data  consists  of  the  48  survival 
times.  The  data  Is  reproduced  In  Table  7.1 

-  Table  7.1  about  here  - 
The  least  squares  AOV  Is  given  In  Table  7.2. 

-  Table  7.2  about  here  - 

Note  that  the  F  test  for  Interaction  falls  to  achieve  significance  at 
the  5%  level.  A  glance  at  a  plot  of  the  cell  means  shows  that  there  are 
crossing  means  In  Poisons  1  and  2  while  Poison  3  Is  almost  Indifferent  to 
which  treatment  Is  applied.  These  sorts  of  patterns  are  highly  suggestive 
of  Interaction.  See  Brown  (1975).  If  we  plot  the  standardized  full  model 
residuals  against  the  full  model  predicted  values,  a  fan  shaped  plot  appears. 
See  Figure  7.1. 

-  Figure  7.1  about  here  - 

There  is  clear  heterogeneity  of  cell  variances  and  the  four  circled  observa¬ 
tions  are  noted  for  their  large  standardized  residual.  Note  the  residuals 
were  standardized  by  dividing  by  the  pooled  estimate  of  their  standard 
deviations. 

We  now  consider  how  a  parallel,  robust  analysis  based  on  the  ranks  of 
the  residuals  can  enhance  the  data  analysis.  We  will  use  Wllcoxon  scores. 
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A  robust  AOV  table  based  on  (3.4)  is  shovm  In  Table  7.3. 

-  Table  7.3  about  here  - 

Note  the  test  for  Interaction  is  now  significant  at  the  5%  level. 

The  pseudo-observations  (3.8)  provide  some  insight  into  how  the  rank 
tests  are  operating  on  the  data.  In  Figure  7.2  we  plot  the  pseudo-observations 
against  the  actual  observations.  The  4  observations  with  large  standardized 
residuals  are  marked.  Notice  how  they  are  "brought  back  in." 

-  Figure  7.2  about  here  - 

In  Figure  7.3  we  plot  the  standardized  residuals  of  the  pseudo-obser¬ 
vations  against  the  robust  predicted  values.  There  are  no  longer  any  extreme 
standardized  residuals. 

-  Figure  7.3  about  here  - 

Further,  it  can  be  seen  how  the  ranking  process  is  working  to  equalize 
the  cell  standard  deviations.  Compare  Figures  7.1  and  7.3. 

Table  7.3  was  based  on  the  drop  in  dispersion  (3.4).  If  the  pseudo- 
observations  were  used  with  a  least  squares  program  then  a  similar  table 
based  on  (3.7)  could  be  constructed.  The  results  are  quite  similar  for  the 
two  approaches. 
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TABLE  5.1 


EMPIRICAL  LEVELS 
PARALLEL  DESIGN  A(n^  -  5) 


NOMINAL 

LS 

D 

B 

Q 

DIST 

LEVEL 

SR 

R 

SR 

R 

SR 

R 

05 

050 

036 

036 

032 

030* 

076'*’ 

056 

NORMAL 

10 

098 

080 

070 

056" 

048" 

182''' 

142* 

DEXP 

05 

060 

048 

044 

040 

046 

094"^ 

076* 

10 

100 

096 

082 

074“ 

078 

178^ 

148* 

05 

0A2 

044 

036 

060 

050 

098^ 

068 

CAUCHY 

10 

114 

108 

082 

096 

086 

200"^ 

160* 

PARALLEL 

DESIGN  i 

■  10) 

NORMAL 

05 

048 

058 

056 

046 

042 

064 

058 

10 

090 

098 

094 

094 

082 

126'*' 

120 

05 

038 

050 

048 

058 

050 

050 

034 

DEXP 

10 

074" 

098 

094 

090 

088 

120 

100 

05 

062 

076'*’ 

062 

09  4'*’ 

096'*' 

068 

064 

CAUCHY 

10 

116 

138^ 

128'*’ 

154’'’ 

138'*’ 

142'*’ 

132* 

PARALLEL 

DESIGN 

B(n^  -  5) 

05 

048 

034 

032 

030" 

028" 

034 

016" 

NORMAL 

10 

088 

072 

068" 

060" 

056" 

100 

090 

05 

092"^ 

062 

046 

104'*’ 

102'*' 

030" 

024" 

CAUCHY 

10 

110 

122 

110 

128'*' 

130'*’ 

102 

078 

Note: 

A  -(+)  is  attached  if 

the  empirical  level  is  belov(above) 

(.031, 

.070) 

(.077,  .123), 

Che  95Z 

intervals 

around 

.05  and  . 

10  respectively. 

TABLE  5.2 

EMPIRICAL  POWER  FOR  .05  TESTS 
PARALLEL  DESIGN  A(n^  -  5) 


LS 

D 

B 

SR 

R 

SR 

R 

NORMAL 

NULL 

050 

036 

036 

032 

030“ 

1 

126 

098 

084 

076 

068 

2 

372 

298 

276 

224 

214 

ALTS 

3 

646 

592 

546 

472 

452 

4 

976 

942 

930 

870 

860 

5 

1.000 

998 

1.00 

998 

1.00 

DEXP 

NULL 

060 

048 

044 

040 

046 

1 

094 

072 

070 

058 

054 

2 

220 

192 

174 

156 

142 

ALTS 

3 

406 

376 

340 

298 

284 

4 

782 

758 

730 

678 

668 

5 

974 

972 

958 

930 

930 

CAUCHY 

NULL 

042 

044 

036 

060 

050 

TABLE  5.3 

EMPIRICAL  POWER  FOR  .05  TESTS 
PARALLEL  DESIGN  A(n^  -  10) 


NORMAL 

NULL 

048 

SR 

058 

R 

056 

SR 

046 

R 

042 

SR 

064 

R 

058 

1 

078 

082 

076 

066 

064 

096 

086 

2 

450 

432 

398 

364 

374 

476 

452 

ALTS 

3 

888 

884 

858 

836 

832 

904 

874 

4 

994 

990 

990 

990 

984 

990 

990 

5 

1.000 

998 

998 

998 

998 

998 

998 

DEXP 

NULL 

042 

050 

048 

058 

050 

050 

034 

1 

134 

164 

150 

148 

142 

176 

152 

2 

424 

530 

504 

486 

476 

536 

506 

3 

772 

832 

814 

784 

800 

838 

794 

4 

986 

984 

984 

990 

990 

988 

986 

5 

998 

998 

996 

1.000 

1.000 

994 

992 

CAUCHY 

NULL 

062 

076''’ 

062 

094''' 

096''' 

068 

064 

1 

070 

096 

086 

126 

112 

124 

090 

2 

146 

400 

386 

376 

380 

410 

390 

3 

248 

738 

734 

744 

734 

710 

678 

4 

380 

894 

892 

904 

902 

844 

824 

5 

480 

958 

960 

964 

966 

912 

908 

TABLE  5.4 

EMPIRICAL  POWER  FOR  .05  TESTS 
PARALLEL  DESIGN  B 


LS 

D 

B 

Q 

SR 

R 

SR 

R 

SR 

R 

NORMAL 

NULL 

048 

034 

032 

030" 

028' 

034 

016" 

1 

252 

196 

178 

154 

150 

204 

124 

2 

624 

486 

424 

422 

428 

448 

328 

ALTS 

3 

894 

790 

760 

760 

764 

684 

546 

4 

990 

974 

954 

936 

930 

842 

718 

5 

996 

988 

986 

982 

974 

884 

768 

CAUCHY 

NULL 

092“^ 

062 

046 

104'*’ 

102'*’ 

030" 

024' 

1 

350 

460 

428 

410 

406 

430 

300 

2 

634 

812 

780 

774 

772 

720 

572 

ALTS 

3 

696 

858 

852 

846 

844 

772 

660 

4 

750 

902 

884 

886 

888 

818 

722 

5 

798 

926 

922 

924 

926 

868 

778 

TABLE  6.1 


design  with  high  leverage  point 


-1-1-11110  0 
-1  0  1-10  1-10 


TABLE  7.1 


Survival  times  (unit,  10  hr)  of  animals  in  a  3  x  4  factorial  experiment 


Treatment 


Poison 


1 


II 


III 


A 

B 

C 

0 

031 

082 

043 

045 

045 

110 

045 

071 

046 

088 

063 

066 

043 

072 

076 

062 

036 

092 

044 

056 

029 

061 

035 

102 

040 

049 

031 

071 

023 

124 

040 

038 

022 

030 

023 

030 

021 

037 

025 

036 

018 

038 

021 

031 

023 

029 

022 

033 

TABLE  7.3 


"AOV" 


SOURCE 

df 

D 

D/df 

TREAT. 

3 

2.9 

.97 

20.9 

POISON 

2 

3.6 

1.8 

38.8 

T  X  P 

6 

.85 

.14 

3.1 

ERROR 

36 

.046 

ERROR  *  f/2  IS  COMPUTED  FROM  THE  FULL  MODEL 


i  FIGURE  7.1 

STANDARDIZED  RESIDUALS  VS.  PREDICTED  VALUES 
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FIGURE  7.2 

PSEUDO-OBSERVATIONS  VS.  OBSERVATIONS 
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FIGURE  7.3 

STANDARDIZED  RESIDUALS  VS.  PREDICTED  VALUES 
BASED  ON  PSEUDO-OBSERVATIONS 


1.804 


0.f04 


«  « 

•  • 


0.0  4  4 


«.»04 


t 

* 


« 

«  « 
« 

2  * 


1.104 


4— . 4 - - 

0.19  0.30  0.49 


•  • 
*  •  • 

« 


« 


•  • 
•  ( 

•  •  • 


- - - 

0.40  0.75  0. 


PREDICTED  value 


Unclassified 


SSCUWITV  classification  of  this  FAOC  riFf>«n  0«a  tnnnd) 


REPORT  DOCUMENTATION  PAGE 

READ  INSTRUCTIONS 

BEFORE  COMPLETING  FORM 

1.  NCFOMT  NUMOen  2.  COVT  ACCESSION  NO. 

Penn  State  Tech.  Rpt.  #36  /^7)—  /■)  Q  7 

2.  NUMBER 

4.  TITLC  rana  StiMilt) 

A  Geometric  Interpretation  of  Inferences 

Based  on  Ranks  In  the  Linear  Model 

s.  tyre  of  rerort  a  rerioo  covered 

t.  RERFORMINO  ORG.  RERORT  NUMBER 

7.  AUTHOn<a> 

Thomas  P.  Hettmansperger,  Penn  State  University 
Joseph  W.  McKean,  Western  Michigan  University 

•.  CONTRACT  OR  GRANT  NUMBERTcI 

N00014-80-C-0741 

»■  FCNFONMINS  ONOANIZATION  NAME  ANO  ADDRESS 

Department  of  Statistics 

Penn  State  University 

University  Park,  PA  16802 

10.  RROGRAM  ELEMENT,  RROJECT,  TAM 
AREA  A  WORK  UNIT  NUMBERS 

NR  042-446 

n.  CONTROLLING  OFFICE  NAME  ANO  AOORESS 

Office  of  Naval  Research 

Statistics  and  Probability  Program  Code  436 
Arlington,  VA  22217 

IZ.  RERORT  DATE 

July  1981 

12.  NUMBER  OF  rages 

39 

>4.  monitoring  AGENCY  name  S  AOOResS<’/<  <0//a/anf /raai  CanmilMS  Of/lea^ 

19.  SeCUMiTV  CLASS,  (ol  tMa  report) 

Unclassified 

1S«.  OCCLASSIPICATIOM/OOWNCRAOINO 
SCHEDULE 

OlSTRiauTtON  STATCMCNT  ('ol  (I1I4 

APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  UNLIMITED. 

17.  DISTRIBUTION  STATEMENT  (ot  (lia  aSairacI  anlara.  In  Black  20,  II  dlllaranl  Naai  Raaart; 

It.  su^flemcntaay  notes 

19.  KEY  WORDS  rC«n«lmf«  on  1/  br  block  numbmr) 

R-estlmates,  rank  tests,  robust  Inference,  nonparametrlc  tests 

-^our  different  approaches  to  testing  and  estimation  based  on  ranks  are 
unified  through  the  geometry  of  the  linear  model.  The  various  tests  are 
Identified  with  various  algebraically  equivalent  forms  of  the  classified 
F-statlstlc.  Small  sample  differences  are  Investigated  via  a  Monte  Carlo 
study  using  both  rank  and  signed  rank  tests.. 

DO  1473/. 


coition  of  1  NOV  fS  IS  OCSOLCTC 

S.  N  0102- LF- OU-  6401 


Unclassified 


SCCUNITV  CLASSIFICATION  OF  THIS  FAOC  (Man  Oaia  CniaraN) 


